Verb Classification – Machine Learning Experiments in Classifying Verbs into Semantic Classes
نویسندگان
چکیده
This paper presents the results of our machine learning experiments in verb classification. Using Beth Levin’s semantic classification of the English verbs as a gold standard, we (i) test the hypothesis that the syntactic behavior of a verb can be used to predict its semantic class, and (ii) investigate whether a robust shallow parser can provide the necessary syntactic information. With 277 verbs belonging to six of Levin’s classes, we do type classification experiments using RIPPER, an inductive rule learner. Having only a set of n most likely subjects or objects as features, this machine learning algorithm is able to predict the correct class with ± 58% accuracy. This result is comparable with results from other researchers, like Merlo and Stevenson, Stevenson and Joanis, and Schulte im Walde.
منابع مشابه
رشد جنبه معنایی فعل در کودک فارسیزبان: مطالعه طولی
Objective Learning “verb” as one of the main components of sentence, has been always a debatable topics in the process of language learning. One of the important issues in “verb” learning is determining its meaning using syntactic clues and learning its semantic aspects. Therefore, the main objective of this study was to examine the development of the semantic aspect of ...
متن کاملComputational Lexicography and Lexicology A Large-Scale Extension of VerbNet with Novel Verb Classes
Lexical classifications have proved useful in supporting various linguistic and natural language processing (NLP) tasks. The largest verb classification in English is Levin's (1993) work. VerbNet (Kipper-Schuler 2006) the largest computational verb lexicon currently available for English provides detailed syntactic-semantic descriptions of Levin classes. While the classes included are extensive...
متن کاملAutomatic Verb Classification Using Distributions of Grammatical Features
We apply machine learning techniques to classify automatically a set of verbs into lexical semantic classes, based on distributional approximations of diathe-ses, extracted from a very large annotated corpus. Distributions of four grammatical features are sufficient to reduce error rate by 50% over chance. We conclude that corpus data is a usable repository of verb class information, and that c...
متن کاملClassifying Arabic Verbs Using Sibling Classes
In the effort of building a verb lexicon classifying the most used verbs in Arabic and providing information about their syntax and semantics (Mousser, 2010), the problem of classes over-generation arises because of the overt morphology of Arabic, which codes not only agreement and inflection relations but also semantic information related to thematic arity or other semantic information like ”i...
متن کاملSupervised Learning of a Probabilistic Lexicon of Verb Semantic Classes
The work presented in this paper explores a supervised method for learning a probabilistic model of a lexicon of VerbNet classes. We intend for the probabilistic model to provide a probability distribution of verb-class associations, over known and unknown verbs, including polysemous words. In our approach, training instances are obtained from an existing lexicon and/or from an annotated corpus...
متن کامل